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Abstract 

Clarkson's algorithm is a two-staged randomized algorithm for solving lin- 
ear programs. This algorithm has been simplified and adapted to fit the 
framework of LP-type problems. In this framework we can tackle a number 
of non-linear problems such as computing the smallest enclosing ball of a 
set of points in M*^. In 2006, it has been shown that the algorithm in its 
original form works for violator spaces too, which are a proper general- 
ization of LP-type problems. It was not clear, however, whether previous 
simplifications of the algorithm carry over to the new setting. 
In this paper we show the following theoretical results: (a) It is shown, 
for the first time, that Clarkson's second stage can be simplified. (6) The 
previous simplifications of Clarkson's first stage carry over to the violator 
space setting, (c) Furthermore, we show the equivalence of violator spaces 
and partitions of the hypercube by hypercubes. 

Keywords: Clarkson's Algorithm, Violator Space, LP-type Problem, Hy- 
percube Partition 



1 Introduction 

Clarkson's algorithm. Clarkson's randomized algorithm [T] is the ear- 
liest practical linear-time algorithm for linear programming with a fixed 
number of variables. Combined with a later algorithm by Matousek, Sharir 
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and Welzl [6], it yields the best (expected) worst-case bound in the unit 
cost model that is known today. The combined algorithm can solve any 
linear program with d variables and n constraints with an expected number 
of 0{(Pn + exp(0(v^(i \ogd))) arithmetic operations [3]. 
Clarkson's algorithm consists of two primary stages, and it requires as 
a third stage an algorithm for solving small linear programs with 0{d'^) 
constraints. The first two stages are purely combinatorial and use very little 
problem-specific structure. Consequently, they smoothly extend to the 
larger class of LP-type problems [6j, with the same running time bound as 
above for concrete problems in this class, like finding the smallest enclosing 
ball of a set of n points in dimension d [3]. 

Both primary stages of Clarkson's algorithm are based on random sampling 
and are conceptually very simple. The main idea behind the use of ran- 
domness is that we can solve a subproblem subject to only a small number 
of (randomly chosen) constraints, but still have only few (of all) constraints 
that are violated by the solution of the subproblem. However, some extra 
machinery was originally needed to make the analysis go through. More 
precisely, in both stages there needed to be a check that the each individ- 
ual random choice was "good" in a certain sense. Then in the analysis one 
needed to make the argument that the bad cases do not occur too often. 
For the first stage, it was already shown by Gartner and Welzl that these 
extra checks can be removed The result is what we call the German Al- 
gorithm below. In this paper, we do the removal also for the second stage, 
resulting in the Swiss Algorithm. (The names come from certain aspects 
of German and Swiss mentality that are reflected in the respective algo- 
rithms.) We believe that the German and the Swiss Algorithm together 
represent the essence of Clarkson's approach. 

Violator spaces. Gartner, Matousek, Riist, and Skovroii proved that 
Clarkson's original algorithm is applicable in a still broader setting than 
that of LP-type problems: It actually works for the class of violator spaces 
[2]. At first glance, this seems to be yet another generalization to yet an- 
other abstract problem class, but as Skovroh has shown, it stops here: the 
class of violator spaces is the most general one for which Clarkson's algo- 
rithm is still guaranteed to work [8]. In a nutshell, the difference between 
LP-type problems and violator spaces is that for the latter, the following 
trivial algorithm may cycle even in the nondegenerate case: maintain the 
optimal solution subject to a subset B of the constraints; as long as there 
is some constraint h that is violated by this solution, replace the current 
solution by the optimal solution subject to -B U {h}, and repeat. Examples 
of such cyclic violator spaces can be found in |8]. For a very easy and 
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intuitive example see also p]. 

It was unknown whether the analysis of the German Algorithm (the stripped- 
down version of Clarkson's first stage) also works for violator spaces. For 
LP-type problems, the analysis is nontrivial and constructs a "composite" 
LP-type problem. Here we show that this can still be done for violator 
spaces, in essentially the same way. 

For the Swiss Algorithm (the stripped-down version of Clarkson's second 
stage), we provide the first analysis at all. The fact that it works in the 
fully general setting of violator spaces comes naturally. 
The main difference of the German and the Swiss algorithm compared to 
their original formulations is the following. In both stages, at some point, 
Clarkson's algorithm checks how many violated constraints some random 
sample of constraints produces. If there are too many, then the algorithm 
discards the sample and resamples. The reason for this is that the analysis 
requires a bound on the number of violators in each step. We essentially 
show that this bound only needs to hold in expectation (and does so) for 
the analysis to go through. So the checks that we mentioned before are 
only an analytical tool, and not necessary for the algorithms to work. 
Let us point out that no subexponential algorithm for finding the basis 
(i.e. "solution") of a violator space is known. Therefore, we can only employ 
brute force to "solve" small violator spaces. Note that, e.g., in the context of 
linear programming, finding a basis means identifying the constraints which 
are tight at an optimal point. We call this the Brute Force Algorithm (BFA). 
Hence, the resulting best worst-case bound known degrades to 0{(fn + 
f{d)), where / is some exponential function of d. In this paper, we will not 
investigate this point further and use BFA as a black box. 

The German Algorithm (GA). Let us explain the algorithm for the 
problem of finding the smallest enclosing ball of a set of n points in (this 
problem fits into the violator space framework). The algorithm proceeds 
in rounds and maintains a working set G, initialized with a subset i? of r 
points drawn at random. In each round, the smallest enclosing ball of G is 
being computed (by some other algorithm). For the next round, the points 
that are unhappy with this ball (the ones that are outside) are being added 
to G. The algorithm terminates as soon as everybody is happy with the 
smallest enclosing ball of G. 

The crucial fact that we reprove below in the violator space framework is 
this: the number of rounds is at most d + 2, and for r ^ dy/n, the expected 
maximum size of G is bounded by 0{d\/n). This means that GA reduces a 
problem of size n to d + 2 problems of expected size 0{dy/n). We call this 
the German Algorithm, because it takes - typically German - one decision 
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in the beginning which is then efficiently pulled through. 

The Swiss Algorithm (SA). Like GA, this algorithm proceeds in rounds, 
but it maintains a voting box that initially contains one slip per point. In 
each round, a set of r slips is drawn at random from the voting box, and 
the smallest enclosing ball of the corresponding set R is computed (by some 
other algorithm). For the next round, all slips are put back, and on top 
of that, the slips of the unhappy points are being doubled. The algorithm 
terminates as soon as everybody is happy with the smallest enclosing ball 
of the sample R. 

Below, we will prove the following: if r ^ (P, the expected number of 
rounds is O(logn). This means that SA reduces a problem of size n to 
O(logn) problems of size O^d"^). We call this the Swiss Algorithm, because 
it takes - typically Swiss - many independent local decisions that magically 
fit together in the end. 

Hypercube partitions. A hypercube partition is a partition of the ver- 
tices of the hypercube such that every element of the partition is the set of 
vertices of some subcube. It was known that every nondegenerate violator 
space induces a hypercube partition [7', f5]. We prove here that also the 
converse is true, meaning that we obtain an alternative characterization of 
the class of violator spaces. While this result is not hard to obtain, it may 
be useful in the future for the problem of counting violator spaces. Here, 
the initial bounds provided by Skovron are still the best known ones [8]. 

Applications. We would love to present a number of convincing applica- 
tions of the violator space framework, and in particular of the German and 
the Swiss Algorithm for violator spaces. Unfortunately, we cannot. There 
is one known application of Clarkson's algorithm that really requires it to 
work for violator spaces and not just LP-type problems [2|; this applica- 
tion (solving generalized P-matrix linear complementarity problems with a 
fixed number of blocks) benefits from our improvements in the sense that 
now also the German and the Swiss Algorithm are applicable to it (with 
less random resources than Clarkson's algorithm). 

Our main contributions are therefore theoretical: we show that Clarkson's 
second stage can be simplified (resulting in the Swiss Algorithm), and this 
result is new even for LP-type problems and linear programming. The fact 
that Clarkson's first stage can be simplified (resulting in the German Al- 
gorithm) was known for LP-type problems; we extend it to violator spaces, 
allowing the German Algorithm to be used for solving generalized P-matrix 
linear complementarity problems with a fixed number of blocks. 
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We believe that our results are significant contributions to the theory of 
abstract optimization frameworks themselves. We have now arrived at a 
point where Clarkson's algorithm has been shown to work in the most 
general abstract setting that is possible, and in probably the most simple 
variant that can still successfully be analyzed. 

2 Prerequisites 

2.1 The Sampling Lemma 

The following lemma is due to Gartner and Welzl in and was adapted 
to violator spaces in We repeat it here for the sake of completeness, 
and because its proof and formulation are very concise. Let S" be a set of 
size ra, and (p : 2^ ^ R a function that maps any set C 5* to some value 
(p{R)- Define 

V(i?) := {seS\R\^{RU{s})^^{R)}, (1) 
X{R) := {seR\^{R\{s})j^^m- (2) 

V(_R) is the set of violators of R, while X(i?) is the set of extreme elements 
in R. Obviously, 

s violates R 4^ s is extreme in i? U {s}. (3) 

For a random sample R of size r, i.e., a set R chosen uniformly at random 
from the set (^) of all r-element subsets of S, we define random variables 
Vj. : i? I— » |V(i?)| and : R ^ |X(_R)|, and we consider the expected values 

Vr := E[V,], 

Xr ■= E[Xr]. 

Lemma 2.1 (Sampling Lemma, [H [2]). For < r < n, 

n — r r + 1 

Proof. Using the definitions of Vr and well as ([3]), we can argue as 

follows: 
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^ ^f f. = [s violates R] 



[s is extreme in U {s}] 
is extreme in Q] 

r + 1/ 

Here, [■] is the indicator variable for the event in brackets. Finally, (^"^) / (") 
(n-r)/(r + l). □ 



2.2 Violator Spaces 

Definition 2.2. A violator space is a pair (if, V), where H is a finite set 
and V is a mapping 2^ — > 2^ such that the following two conditions are 
fulfilled. 

Consistency: G fl V(G') = holds for all G C H, and 
Locality: for all F C G C H , where G H V(F) = 0, 

we have V(G') = V(F). 

Lemma 2.3 (Lemma 17, [2j). Any violator space (-ff, V) satisfies mono- 
tonicity defined as follows: 

Monotomcity: V(F) = V(G') «mp/«e5 V(E) = V(F) = V(G') 
for all sets F C E CG C H. 

Proof Assume \/{E) ^ V(F), V(G'). Then locality yields ij) ^ E n V(F) = 
E n V(G) which contradicts consistency. □ 

Definition 2.4. Consider a violator space {H,\/). 

(i) We say that B C H is a basis if for all proper subsets F C B we have 
B n V(F) 7^ 0. For G C H , a basis of G is a minimal subset B of G 
with y{B) = V(G). A basis in {H, V) is a basis of some set G C H . 

(a) The combinatorial dimension of {H, V), denoted by dim(if, V), is the 
size of the largest basis in {H, V) . 

(Hi) {H,V) is nondegenerate if every set set G C H, \G\ > dim(ii', V), 
has a unique basis. Otherwise {H, V) is degenerate. 
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Observe that a minimal subset B CI G with V(i?) = \I{G) is indeed a 
basis: Assume for contradiction that there is a set F C -B such that B fl 
V(F) = 0. Locality then yields \I{B) = V(F) = V(G), which contradicts 
the minimality of B. Also, note that, because of consistency, any basis B 
of H has no violators \/{H) = V(5) = 0. 

Corollary 2.5 (of Lemma l2.ip . Let {H, V) be a violator space of combinato- 
rial dimension d, and \H\ = n. If we choose a subset R H , \R\ = r < n, 
uniformly at random, then 

71 — T 

EMR)\]<d^. 

Proof. The corollary follows from the Sampling Lemma 12.11 with the ob- 
servation that \X{R)\ <d,WRCH. □ 

3 Clarkson's Algorithm Revisited 

Clarkson's algorithm can be used to compute a basis of some violator space 
(i?, V), n = \H\. It consists of two separate stages and the Brute Force 
Algorithm (BFA). The results about the running time and the size of the 
sets involved is summarized in Theorem 13.61 and Theorem I3.12[ 
The main idea of both stages (GA and SA) is the following: We draw a 
random sample i? C if of size r = \R\ and then compute a basis of R using 
some other algorithm. The crucial point here is that r <^ n hopefully. 
Obviously, such an approach may fail to find a basis of if, and we might 
have to reconsider and enter a second round. That is the point at which 
GA and SA most significantly differ. 

In both stages we assume that the size of the ground set, i.e., n, is larger 
than r, such that we can actually draw a sample of that size. We can 
assume this w.l.o.g., because it is easy to incorporate an if statement at 
the beginning that directly calls the other algorithm should n be too small. 

3.1 The German Algorithm (GA) 

This algorithm works as follows. Let (-ff, V) be a violator space, \H\ = n, 
and dim(ii', V) = d. We draw a random sample R C H , r = d^/njl., only 
once, and initialize our working set G with R. Then we enter a repeat 
loop, in which we compute a basis B oiG and check whether there are any 
violators in H. If no, then we are done and return the basis B. If yes, then 
we add those violators to our working set G and repeat the procedure. 
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The analysis will show that (i) the number of rounds is bounded by d + 1, 
and {ii) the size of G in any round is bounded by 0{dy/n). See Theorem 

m 

Algorithm 1: GA(iJ, V) 

input : Violator space V), \H\ = n, and dim(iJ, V) = d 
output: A basis B of {H, V) 

r d^yn/2; 

Choose R with \R\ = r , R C H u.a.r.; 

G < — R] 

repeat 

B ^ SA(G, V|^); 

G^GU\/{B); 
until V(5) = ; 
return B 



We will adopt some useful notations which we will use in the following 
proofs. First, let us point out that the notation V|^ refers to the violator 
mapping restricted to some set F C H. 

Definition 3.1, For i > 0, by 

B^S,vi\ andGf 

we denote the sets B, V(i?), and G computed in round i of the repeat loop 
above. Furthermore, we set G^^ := R, while B^^ and V^^ are undefined. 
In particular, we have that B^^ is a basts ofG%~^\ andVl^^ = ^iG^^'^^). If 
the algorithm performs exactly i rounds, sets with indices i > i are defined 
to be the corresponding sets in round i. 

The next one is an auxiliary lemma that we will need further on in the 
analysis. It is a generalization of the fact that there is at least one element 
of the basis of H found as a violator in every round (see also Lemma 13^ . 

Lemma 3.2. For j <i<l,B^j^r\ V^'^ ^ 0. 

Proof. Assume that b''^ fl V^"^ = 0. Together with consistency, fl 
vj^-'^ = 0, this implies 

(5« U V Vi^) = 0. 
Now, applying locality and the definition of basis, we get 

y{Bl^UG^~'^) = v!^^ = V{B^^). (4) 
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On the other hand, since V^'^ = V(5{J^) and ^JJ^ C U G^^"^^ C G^^^^, 
we can apply monotonicity and derive 

F«=V(i?«)=V(i?«UGr^)). (5) 

Note that V{B^^^) C G^^~^\ because G always contains the violators from 
previous rounds. Additionally, by equations (Jlj) and ([5]) we have that 
V^*) = V(5g^ U G^f^^) = y{B^^^). Thus, we can build a contradiction 
of consistency, 

gV^ n K« D v(i?(^)) n vH^ = y{B^£) + 0. 

The last inequality holds because j is not the last round. □ 

The following lemma is the crucial result that lets us interpret the devel- 
opment of the set G in Algorithm [T] as a violator space itself. 

Lemma 3.3. Let {H, V) be a violator space of combinatorial dimension d. 
For any subset R <Z H define 

T{R) :=(l4^\...,\/f ). 

Using this we can define a new violator mapping as follows, 

V{R) := {h e H\R I T{R) ^ T{R U {h})}. 
Then the following statements are true: 

(i) (if, V) is a violator space of combinatorial dimension at most {^^^^ ■ 

(ii) The set V'(i?) is given by 

V{R) = V^'^ U . . . U V^''^ = Gf\R. 

(iii) // (if, V) is nondegenerate, then so is {H, V) . 

In order to prove Lemma [3^3] we first need an auxiliary claim. Note that U 
denotes disjoint union. 

Claim 3.4. Let Q be any set with Q = R U T C H and i < d. If 



then 
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Proof of Claim\3^ We prove the claim by induction on i. First, if z = 
the precondition reads V(Q) = V(i?). It follows that G'^q^ = QU V(Q) = 

(i? U T) U V(i?) = G^J^ U T. 

Suppose the claim is true for j < i. From Vq~^^^ = vj^~^^^ we can deduce 

= Gg^ U V^'+'^ = (G« U T) U ^4'+') = U T. 

□ 

Before we proceed to the proof of Lemma 13.31 let us first state the conse- 
quences, which we obtain by applying Lemma [2TT] to the violator space that 
we constructed. 

Theorem 3.5 (Theorem 5.5 of For R C H with \H\ = n, and a 
random sample of size r, 



d + l\n — r 
2 ) r + 1 



Choosing r = d^/ri/2 yields 

E[\Gf\]<2{d + l)^. 

Proof of Theorem 1 3'. 51 The first inequality directly follows from the sam- 
pling lemma (Lemma 12. ip . applied to the violator space (if, V), together 
with part (ii) of Lemma 13^31 The second inequality follows from plugging 
in the value for r. □ 

Let us now come back to the Lemma. 

Proof of Lemma \3.3[ 

Proof of (i). We first need to check consistency and locality as defined in 
Definition [221 

Consistency is easy, by the definition of V. Since the violators of R C H 
are chosen from H\R exclusively, we can be sure that R fl V'(i?) = for 
all R. 

Let us recall what locality means. For sets i? C Q C if , if Q fl V'(i?) = 0, 
then V'((5) = V'(i?). This we are going to prove by induction on the size 
of Q\R. If |Q\i?| = 0, then the two sets are the same, and locality is 
obviously fulfilled. Now, suppose that \Q\R\ = i and locality is true for 
any smaller value j < i. Consider some set S fulfilling R C S C Q and 
Q = SiJ {q}. First note that, if Q n V{R) = 0, then also S n V'(i?) = 0. 
Therefore, the precondition for the induction hypothesis is fulfilled, and we 
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can conclude that V'(i?) = V'(S'). Bearing this in mind, we can make the 
following derivation: 



Q n V'(i?) = ^ Q n V{S) 

q&Q 



Def. m 
Def. GJ 



q ^ V'(5) 
r(^) = T{S U {q}) = TiQ) 

y'is) = v{Q) 

V{R)=ViQ). 



That shows the locality of the violator space (if, V). 

We still have to show that {H, V) has combinatorial dimension at most 

('^+^). To this end we prove that \/'{Br) = V'(i?), where 



Bn:=Rn[jB 



{i) 
R 

i=l 



Note that B^^ as we will show in (iii), is in fact the unique basis of the set 
R <^ H. By bounding the size of B^ we therefore bound the combinatorial 
dimension of {H,V). Equivalent to \/'{Br) = V'(i?) we show that V^^j = 

Vji^\ for l<j<d, using induction on j . For j = 1 we get 
V(i?)=V(5«U(i?\i?^))=V(i?^), 

because R\Bji is disjoint from B^j^\ the basis of R. Therefore, R\Bji = 
R \ UiLi B^R can be removed from R without changing the set of violators. 
Now assume that the statement holds for ?' < — 1 and consider the case 
J = d. By Claim [331 we get Cg"^^ = G^'^^^ U Since R\Bn is 

disjoint from the basis B^^^ of G^"^'' it follows that 

Vl^) = y{Gt'^) = ViGt'^ U {R\Bn)) = ^{0^'^) = V^^^. 
To bound the size of Br, we observe that 

|i?nsg^| <d+l-i, 
for all z < £ (the number of rounds in which V(i?) 7^ 0). This follows 



from Lemma I3.2[ B^^ has at least one element in each of the i — 1 sets 



V^, . . . , vj^ ^\ which are in turn disjoint from R. Hence we get 



B„\<±\RnB'i'\<('Y) 
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Proof of (ii). We show that if some constraint g G is in V'(i?) then it 
is also in vj^^ for some 1 < i < d. On the other hand if q ^ then q 

is not in any of the V^*"*, 1 < i < d. This proves the statement of (ii). 
Assume q G V'(i?) and let Q := RU{q}. Consider the largest index i < d—1 
such that 

Note that such an index i must exist, because V'(_R) 7^ V'(Q), which sim- 
ply follows from q G V'(i?) and g ^ V'(Q). Then, from Claim [X4l it fol- 
lows that Gq^^^ = G^^^^ U {g}, and by assumption on i we know that 
_^ ^ Therefore, by the contrapositive of locality, we conclude 

U {q}) n V(g2+'^) ^ 0. This means that q G V(Gi5+'^) = 
because otherwise the consistency of G^^^^ would be violated. 
On the other hand, if g ^ V'(_R), then V'(i?) = V'((5), or equivalently 
V^^ = for 1 < 2 < rf. However, because {H, V) is consistent it follows 
that g ^ Vq*\ and therefore g ^ V^*"*, for 1 < i < d. 

Proof of (iii). Nondegeneracy of {H, V) follows if we can show that every 
set C if has the set Br as its unique basis. To this end we prove that 
whenever we have L C R with V'(L) = V'(_R), then Br C L. 
Fix L C with V'(L) = V'(i?), i.e.. 



Claim [3741 then implies 



of = Gf U Q<i<d, 

and the nondegeneracy of {H, V) yields that G^^ and G^^ have the same 
unique basis -B/j^^\ for all < i < c?. Note that B^^^^ is indeed contained 
in Gf, because V(Gi;^) = \l{G^ll) = V{Gf\J{R\L)) = V{bI,^^^) for < 
i < d. That means, if there exists a basis of G^l\ that by definition would 
also be a basis of G^^ , but distinct from -B^^^"*, nondgeneracy is violated. 
It follows that G^^ contains 



i=l 



SO L contains 

Ln[jB^^ = Rn[jB^^ 

i=l i=l 
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The latter equality holds because R\L is disjoint from Gj^ , thus in partic- 
ular from the union of the . □ 

Theorem 3.6. Let {H, V) be a violator space of combinatorial dimension 
d, and n = \H\. Then the algorithm GA computes a basis of {H,\/) with 
at most d + 1 calls to SA, with an expected number of at most 0{d^/n) 
constraints each. 

Proof. According to Lemma 13.21 (and maybe more intuitively according to 
Lemma lXSl) . in every round except the last one we add at least one element 
of any basis of {H, V) to G. Since the size of the basis is bounded by d we 
get that the number of rounds is at most d+1. Furthermore, according to 
Theorem 13.51 and our choice r = d^yn/2, the expected size of G will not 
exceed 2{d + \)\fnj2 in any round. □ 



3.2 The Swiss Algorithm (SA) 

The algorithm SA proceeds similar as the first one. Let the input be a 
violator space (if, V), \H\ = n, and dim(if, V) = d. 

First, let us (re)introduce the notation R^'^\ B^^\ and V^'^^ for i > 1, similar 
as in Definition 13.11 for the sets R, B and V(i?) of round i respectively. 
The set B^^ is a basis of R^^ and V^^ = V(i?(*)) = V(5(^)). Since we draw 
a random sample in every round it does not make sense to index the sets 
B(^) and by R, so we drop this subscript. 

After the initialization we enter the first round and choose a random sample 
R^^^ of size r = 2d'^ uniformly at random from H. Then we compute an 
intermediate basis B^^^ of the violator space {R^^\ '^Irw) by using BFA as 
a black box. In the next step we compute the set of violated constraints, 
i.e., V^^\ So far, it is the same thing as the first stage. But now, instead 
of enforcing the violated constraints by adding them to the active set, we 
increase the probability that the violated constraints are chosen in the next 
round. This is achieved by means of the multiplicity or weight variable /x. 

Definition 3.7. With every h E H we associate the multiplicity fit G N. 
For an arbitrary set F C H we define the cumulative multiplicity as 

For the analysis we also need to keep track of this value across different 
iterations of the algorithm. For i > we will use fj!'^ (and ijl^'^\F)) to 
denote the (cumulative) multiplicity at the end of round i. We define ix'^^ := 
1 for any h E H, and therefore jj,^^\F) = \F\. 
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Now back to the algorithm. To increase the probability that a constraint 
h e y^*^ is chosen in the random sample of round i + 1 we double the 
multiplicity of h, i.e., fi^i^'' = ^\ 

The multiplicities determine how the random sample R^^^^^ is chosen. To 
this end we construct a multiset to which we add /x^*'* copies of every 

element h & H. To simplify notation, let us for a moment fix the round 
i + 1 and drop the corresponding superscript. 

We define the function : 2^ — > 2''^ as the function that maps a set of 
elements from H to the set of corresponding elements in H, i.e., ioi F C H 

<t^{F):= U{/.i,...,V}, 

heF 

where the hj, 1 < j < Hh, are the distinct copies of h. For example, 
H — (j){H). Conversely, let : 2''^ — > 2''^ be the function that collapses a 
given subset of H to their original elements in H, i.e., for F C if, 

V^(F) :={he H \<p{{h})nF 

Reintroducing the superscript i + 1 we can simply say that we construct 
— (p[H) using the multiplicities from round i. The sample 7?'^*+-^' is 
then chosen u.a.r. from the r-subsets of ff'-*^^-'. In the following the multiset 
property will not be important any more and we can discard multiple entries 
to obtain = i^{R^'+^^). Note that 1 < < r. Then we continue 

as in round 1. Note that in the first round this is in fact equivalent to 
choosing an r-subset u.a.r. from if, because ji-^^ = 1 for all h G H. 
The algorithm terminates as soon as V^^^ = for some round i > 1 and 
returns the basis B^^\ 



Algorithm 2: Sk{H, V) 

input : Violator space (if, V), \H\ = n, and dim(ii, V) = d 
output: A basis B of {H, V) 

/ih i for all h & H; 

r ^ 2(P- 

repeat 

choose random R from H according to ii\ 

5^BFA(i?,V|J; 

Hh ^ 2/x/i for all h e V(i?); 
until V{B) = ; 
return B 
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Let us first discuss an auxiliary lemma very similar in flavour to Lemma 



Lemma 3.8 (Observation 22, 0). Let (-ff, V) be a violator space, F C 
G C H, and G fl V(F) ^ 0. Then G fl V(F) contains at least one element 
from every basis ofG. 

Proof. Since the proof is pretty short we repeat it here. Let B be some 
basis of G and assume that i?nGnV(F) = i?nV(F) = 0. From consistency 
we get F n V(F) = 0. Together this implies 

{BUF)n V(F) = 0. 

Applying locality and monotonicity, we get 

V(F) = V{BUF) = y{G), 

meaning that G n V(G') = G n V(F) = 0, a contradiction. □ 

The analysis of SA will show that the elements in any basis B of H will 
increase their multiplicity so quickly that they are chosen with high prob- 
ability after a logarithmic number of rounds. This, of course, means that 
the algorithm will terminate, because there will be no violators. Formally, 
we will have to employ a little trick though. We will consider a modifica- 
tion of SA that runs forever, regardless of the current set of violators! Let 
us call the modified algorithm SA_f orever. We call a particular round i 
controversial if V^"^^ ^ 0. Furthermore, let Gi be the event that the first i 
rounds are controversial in SA_f orever. 

Lemma 3.9. Let (-ff, V) be a violator space, \H\ = n, dim(ii/', V) = d, B 
any basis of H, and k E N some positive integer. Then, in SA_ forever, 
the following holds for the expected cumulative multiplicity of B after kd 
rounds, 

2^Pr[Cfc,]<E[^('='^)(5)]. 

Proof. In any controversial round. Lemma 13.81 asserts that B fl V^"^^ ^ 0. 
So, in every controversial round, the multiplicity of at least one element 
in B is doubled. Therefore, by conditioning on the event that the first kd 
rounds are controversial, there must be a constraint in B that has been 
doubled at least k times (recall that \B\ < d). It follows that E[/i('"^)(5)] = 
E[/x('='^)(5) I Gud] Pr[CfcJ + ^^^^^^\B) \ Pr^] > 2^ Pr[Cfc,]. □ 

Lemma 3.10. Let (i^, V) be a violator space, \H\ = n, dim (if , V) = d, B 
any basis of H , and A; G N some positive integer. Then, in SA_ forever, 
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the following holds for the expected cumulative multiplicity of B after kd 
rounds, 

Proof. Let us point out first, that the following analysis goes through for 
SA_forever as well as for SA, but to make it match Lemma [X9l we formu- 
lated it using the former. 

Note that E[/i('='^)(5)] < E[ij,^'"^\H)], because B C H. Therefore, if we 
show the upper bound for the latter expectation we are done. Let i := kd 
be the number of rounds, and A^^\F) := fi^'^^F) — the increase 

of multiplicity from one round to another, for any i > 1 and F C H. We 
write the expected weight of H after i rounds as the sum of the initial 
weight plus the expected increase in weight in every round from 1 to £, 

i 

E[^W(i7)] = E[^(°)(/J)] + J2m^HH)]- (6) 

i=l 

The first term is easy, E[fx^^\H)] = n, and the second term we write as a 
conditional expectation, assuming that the weight in round i — 1 was t, 

i e CO 

5^E[A«(iJ)] = 5^^E[AW(if)|/i(-i)(iJ) = t] Pr = t]. (7) 

i=l i=l t=0 

Now comes the crucial step. According to Lemma [2TT] we can upper bound 
E[A^^\H)\jj,^^~^\H) = t] by interpreting it as the expected number of vio- 
lators of a multiset extension of {H, V). To this end we construct a violator 
space V), where H^^^ = (p{H) using the multiplicities from round i — 

Let us fix round i and drop the superscript for the moment. For any F (1 H 
we define 

V(F) :=0(V(^(F))). 

We observe that {H, V) is indeed a violator space. For F <^ H, consistency 
is preserved, because from consistency of (-ff, V) it follows that (j){ip{F)) fl 
0(V('?/'(F))) = 0, and knowing F C (j){Tp[F)), we can conclude consistency 
of (^,V). Similarly, for F C G C iY, locality of (i/, V) tells us that if 
n (t){y{^{F))) = then (p{\/{ip{F))) = 0(V(^(G))), and knowing 
G C locality of {H, V) follows. 

The violator space we just constructed has the same ground set H by means 
of which we draw the random sample R in every round. By supplying a 
valid violator mapping we asserted that we can apply the sampling lemma 
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to that process. Some thinking reveals that d = dim(if, V) = dim{H, V) 
(even though we introduced degeneracy), and we can conclude 

E[A«(if)|/i(^-i)(if) =t]= E[|V(i?«)|] < (8) 

Therefore we get the simplified expression 



oo 



t - r 



i=l t=0 ■ ~'~ 



d 



n 

i=l \' ' ^ t=o 



dr 



oo 



-^Pr[/i(^-^)(i/) = t] 



r + _ 

t=o 



i=l 



The first line is derived from ([6]), (JT]), and (l8|). The rest is routine. Dropping 
the last term we get the following recursive equation, 

E[/.W(i/)]<n + -^|jE[^« 

which easily resolves to the claimed bound. □ 
Using i = kd, and combining Lemmata 13.91 and 13.101 we now know that 

2^^ PilCA <n(l + - 
\ r 

This inequality gives us a useful upper bound on Pr[C^], because the left- 
hand side power grows faster than the right-hand side power as a function 
of i, given that r is chosen large enough. 

Let us choose r = cd"^ for some constant c > loga e ~ 1.44. We obtain 

using 1 + X < = 2^'°S2<^ for all x. This further gives us 

Pr[C^] < na\ (9) 
a = a{d,c) = 2(i°S2e-^)/(crf) < i_ 
This implies the following tail estimate. 
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Lemma 3.11. For any (3 > 1, the probability that SA_ forever starts with 
at least \P log^/^ n\ controversial rounds is at most 

Proof. The probability for at least this many leading controversial rounds 
is at most 

□ 

We can also bound the expected number of leading controversial rounds in 
SA_f orever, and this bounds the expected number of rounds in SA, because 
SA terminates upon the first non-controversial round it encounters. 

Theorem 3.12. Let {H, V) be a violator space, \H\ = n, and dim {H, V) = 
d. Then the algorithm SA computes a basis of H with an expected number 
of at most 0{d\nn) calls to BFA, with at most 0{d'^) constraints each. 

Proof. By definition of C^, the expected number of leading controversial 
rounds in SA_f orever is 

i>i 

For any /5 > 1, we can use ([9]) to bound this by 

r/31ogi/„n]-l 

1 + n a' = r/31ogi/,nl -1 + n " 

1=1 £=r/31ogi/,nl 

< /?logi/,n+^ 

= /31ogi/^n + o(l). 

This upper bounds the expected number of rounds in SA. In every round 
of SA one call to BFA is made, using cd"^ constraints, where c > log2e is 
constant. □ 

4 Hypercube Partitions 

Let if be a finite set. Consider the graph on the vertices 2^, where two 
vertices F, G are connected by an edge if they differ in exactly one element, 
i.e., G = FU {/i}, h E H. This graph is a hypercube of dimension n = \H\. 
For the sets ACBCH.we define [A, B] := {C C H \ A C C C B} and 



18 



call any such [A, B] an interval. A hypercube partition is a partition V of 
2^ into (disjoint) intervals. 

Let {H, V) be a violator space. We call two sets F,G H equivalent if 
V(F) = V(G'), and let H be the partition of 2^ into equivalence classes 
w.r.t. this relation. We call Ti. the violation pattern of (if, V). 
Before we formulate and prove the Hypercube Partition Theorem, we need 
to introduce some notation. We extend the notion of violator spaces by 
the concept of anti-basis. 

Definition 4.1. Consider a violator space {H, V). We say that B C H is 
an anti-basis if for all proper supersets F D B we have V(i?) fl F 7^ 0. An 
anti-basis of G C H is a maximal superset B of G with V(i?) = V(G'). 

Note that a maximal superset B oi G such that V(_B) = V(G) is indeed 
an anti-basis of G. Suppose that there is a set B' D B with V(5) n B' = 
0. Locality then decrees that V(i?) = V(S'), but this contradicts the 
maximality of B. 

Lemma 4.2. Consider the violator space {H,\/). For any G C H there is 
a unique anti-basis Bq of G. 

Proof. Suppose that there exist two distinct anti-bases B and B' of G. 
Because of V(5) = V(S') and consistency we have that {B U B') n V(S) = 
{B U B') n y{B') = 0. Therefore, by locality, V(S U B') = V(S') = V(S). 
Since B and B' are distinct, it cannot be that B\B' = and B'\B = at 
the same time. Then, in any case, \B U B'\ > \B\ or \B U B'\ > \B'\ holds, 
which contradicts the maximality of the anti-bases. □ 

Corollary 4.3. Let (ii, V) be a violator space, G C H, Bq any basis ofG, 
and Be the unique anti-basis of G. Then for any set F , Bq C F C Bq, F 
and G are equivalent, i.e., V(F) = V(G'). 

Proof. This is an immediate consequence of monotonicity (Lemma l2.3l) . □ 
Lemma 4.4. Ti completely determines (if, V). 

Proof. Let G ^ H. There is a unique anti-basis Bg of G, meaning that 
in 7i, there is a unique inclusion-maximal superset of G in the same class 
of the partition. This implies that V(G) = ^iBc = H\ Bg), so {H, V) is 
reconstructible from Ti. □ 

Lemma 4.5. // {H,\/) is nondegenerate (unique bases), then Ti is a hy- 
percube partition. 
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Proof. We first show that V(5) = V(5') implies V(5 n B') = V(5 U B') = 
y{B). The latter has been shown for the existence of a unique anti-basis. 
For the former, we argue as follows. Let A be the unique basis of 5 U B'. 
Then \/{A) = V(5) = \/{B'). But then A is also the unique basis of B and 
B'. It follows that AC BnB', and by locality we get V(A) = V(5 n B') = 
V(E). 

This argument implies that any partition class C is contained in the interval 
[flcec Ucec other hand, the whole interval is contained in C 

by locality, so we are done. □ 

Lemma [441 and 1451 together imply that there is an injective mapping from 
the set of nondegenerate violator spaces to the set of hypercube partitions. 
It remains to show that the mapping is surjective. 

Theorem 4.6. Any hypercube partition V is the violation pattern of some 
nondegenerate violator space {H, V) 

Proof. Let G C H, and let [B, B'] be the interval containing G. We define 
V(G) = H \ B' and claim that this is a nondegenerate violator space with 
violation pattern V. The latter is clear, since V(F) = V(G') if and only 
if F,G C [B,B']. To see the former, we observe that consistency holds 
because of G C B'. To prove locality, choose G C G' with H \ B' = 
V(G') n G" = 0. In particular, G' C B', so G" is also in [B, B'] and we get 
vic) = V(G') by definition of V. 

It remains to show that the violator space thus defined is nondegenerate. 
Let B,B' be two sets with V(_B) = V(_B'), meaning that they are in the 
same partition class of V. But then B n B' is also in the same class, and 
we get y{B) = \/{B fl B'). This implies existence of unique bases. □ 

5 Conclusion 

We analyzed Clarkson's algorithm in what we believe to be its most general 
as well as natural setting. Additionally, we have given the equivalence 
between non-degenerate violator spaces and hypercube partitions, which 
could help identifying further applications in computational geometry as 
well as other fields of computer science. Another major challenge will 
be to establish a subexponential analysis for the third stage, BFA, in the 
framework of violator spaces (as there already exists for LP's and LP-type 
problems), in order to get stronger bounds when the dimension is only 
moderately small. 
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